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Abstract 

We describe in this paper a connection between bifix codes, interval 
exchange transformations and free groups. This is in the spirit of the 
connection established previously for Sturmian words instead of interval 
exchange transformations. We prove as a main result that for the set F 
of factors of a regular interval exchange transformation, a finite bifix code 
X on the alphabet A is F-maximal of F-degree d if and only if it is the 
basis of a subgroup of degree d of the free group on A. The proof uses the 
fact that, in a regular interval exchange set, the sets of first return words 
form a basis of a free group. 
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1 Introduction 

Interval exchange transformations were introduced by Oseledec [18] following 
an earlier idea of Arnold [2]. These transformations form a generalization of 
rotations of the circle. 

In this paper we study a relation between interval exchange transformations 
and bifix codes. The paper is a continuation of the paper with part of the 
present list of authors on bifix codes and Sturmian words [5]. We understand 
here by Sturmian words the generalization to arbitrary alphabets, often called 
strict episturmian words or Arnoux-Rauzy words (see the survey |13j). of the 
classical Sturmian words on two letters. 

The starting point of this research is the observation that the family of 
Sturmian sets is not closed under decoding by a maximal bifix code, even in the 
more simple case of the code formed of all words of fixed length n. Actually, 
the decoding of the Fibonacci word (which corresponds to a rotation of angle 
a) by blocks of length n is an interval exchange transformation corresponding 
to a rotation of angle na coded onn + 1 intervals. This has lead us to consider 
the set of factors of interval exchange transformations, called interval exchange 
sets. 

One of the results proved in this paper is that the family of regular interval 
exchange sets is closed under decoding by a maximal bifix code (Theorem l5.21[) . 

This result invited us to try to extend to regular interval exchange transfor- 
mations the results relating bifix codes and Sturmian words. 

As a main result, we prove that, under natural hypotheses satisfied by a 
regular interval exchange set F, a finite bifix code X on the alphabet A is F- 
maximal of F-degree d if and only if it is the basis of a subgroup of degree d of 
the free group on A (Theorem 16.11 called below the Basis Theorem) . 

We actually introduce several classes of sets of words on k + 1 letters having 
all kn+1 elements of length n for all n > 0. The smallest class (BS) is formed 
of the Sturmian sets on a binary alphabet, that is with k = 1 (see Figure ITTTj) . 
It is contained both in the class of regular interval exchange sets (denoted RIE) 
and of Sturmian sets (denoted S). The next one is the class of sets satisfying the 
tree condition, called tree sets, which contains the two previous ones. Finally, 
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the class of sets satisfying the neutrality condition called neutral sets contains 
the previous one. 

The proof for regular interval exchange sets uses the property, proved in [7] , 
that the sets of first return words form a basis of the free group (this result is 
referred to below as the Return Theorem). 

We have tried in all the paper to use the weakest of these conditions to prove 
our results. As an example, we prove that, under the neutrality condition, any 
finite F-maximal bifix code of i^-degree d has 1 + d(Card(A) — 1) elements 
(Theorem 13.41 called below the Cardinality Theorem) . 



CT RT BT 

RIE + S yes yes yes 

T ~^es~ ? T~ 

N yes no no 

kn + 1 no no no 



Figure 1.1: The classes of sets on k + 1 letters: Binary Sturmian (BS), Regular 
interval exchange (RIE), Sturmian (S), Tree (T), Neutral (N), of complexity 
kn + 1 

For each class the array on the right of Figure [TTT1 indicates whether (with the 
additional hypothesis of uniform recurrence) it satisfies the Cardinality Theorem 
(CT), the Return Theorem (RT) or the Basis Theorem (BT). All these classes 
are distinct. Moreover, it can be shown that the intersection of RIE and S is 
reduced to BS. Indeed, Sturmian sets on more than two letters are not the 
factors of an interval exchange transformation with each interval labeled by a 
distinct letter (the construction in [3] allows to obtain the Sturmian sets of 3 
letters as an exchange of 7 intervals labeled by 3 letters). The class RIE is 
closed under decoding by a maximal bifix code (Corollary I5.22[) . 

This leaves open therefore the identification of a class of sets containing the 
Sturmian sets and closed under decoding by a maximal bifix code for which the 
Basis Theorem is true. 

Our main result holds for the class of tree sets F on an alphabet A which are 
uniformly recurrent, and satisfy the condition that the set of first right return 
words to any word in F is a basis of the free group on A. 

The paper is organised as follows. 

In Sections [3] and [H we introduce the neutrality and the tree condition. 
These conditions are satisfied by Sturmian sets and, as shown in Section EJ by 
regular interval exchange sets. 
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We also introduce a variant of the tree condition called the planar tree con- 
dition. We prove that both the tree condition and the planar tree condition 
imply a stronger version of themselves (Propositions 14.61 and I4.9P . 

In Section [SI we recall some notions concerning interval exchange transfor- 
mations. We state the result of Keane [TB] which proves that regularity is a 
sufficient condition for the minimality of such a transformation (Theorem I5.3[) . 

We introduce the relation between interval exchange transformations and bi- 
fix codes. We prove that the transformation associated with a finite F-maximal 
bifix code is an interval exchange transformation (Proposition 15.17]) . We also 
prove a result concerning the regularity of this transformation (Theorem 15. 21)) . 

In Section [5] we prove the Basis Theorem characterizing the bases of sub- 
groups of finite index of the free group included in a regular interval exchange 
set F (Theorem 16. 3|) which is proved in [5] for a Sturmian set F. The proof 
follows the same path as the corresponding one for Sturmian sets. Several in- 
termediary results hold under the weaker assumption that the set F satisfies 
the tree condition (Propositions 16.151 and I6.17P . The final argument uses the 
fact that, in a regular interval exchange set, the sets of first return words form 
a basis of the free group (Theorem 4.16 in [7]). 

2 Preliminaries 

In this section, we first recall some definitions concerning words. We give the 
definition of recurrent and uniformly recurrent sets of words. We also give 
the definitions and basic properties of bifix codes (see [5] for a more detailed 
presentation). 

2.1 Recurrent sets 

Let A be a finite nonempty alphabet. All words considered below, unless stated 
explicitly, are supposed to be on the alphabet A. We denote by A* the set of 
all words on A. We denote by 1 or by e the empty word. A set of words is said 
to be factorial if it contains the factors of its elements. 

Let F be a set of words on the alphabet A. For w £ F, we denote 

L(w) = {a £ A | aw £ F} 

R( w ) = { a £ A | wa £ F} 

E(w) = {{a,b) £ A x A | awb £ F} 

and further 

i{w) = Card(i(w)), r(w) = Caxd(R(w)), e(w) = Caxd(E(w)). 

A word w is right- extendable if r(w) > 0, left- extendable if i(w) > and biex- 
tendable if e(w) > 0. A factorial set F is called right-essential (resp. left- 
essential, resp. biessential) if every word in F is right-extendable (resp. left- 
extendable, resp. biextendable). 
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A word w is called right-special if r(w) > 2. It is called left-special if £(w) > 
2. It is called bispecial if it is both right and left-special. 

A set of words is recurrent if it is factorial and if for every u,w G F there is 
a v G F such that uvw G F. A recurrent set F ^ {1} is biessential. 

A set of words F is said to be uniformly recurrent if it is factorial and if, for 
any word u G F, there exists an integer n > 1 such that u is a factor of every 
word of F of length n. A uniformly recurrent set is recurrent. 

A morphism f : A* — > B* is a monoid morphism from A* into B* . If a G A 
is such that the word /(a) begins with a and if |/ n (a)| tends to infinity with 
n, there is a unique infinite word denoted /"(a) which has all words f n (a) as 
prefixes. It is called a fixpoint of the morphism /. 

A morphism / : A* — > A* is called primitive if there is an integer k such 
that for all a, 6 G A, the letter 6 appears in f k (a). If / is a primitive morphism, 
the set of factors of any fixpoint of / is uniformly recurrent (see |12) Proposition 
1.2.3 for example). 

An infinite word is episturmian if the set of its factors is closed under reversal 
and contains for each n at most one word of length n which is right-special. It is 
a strict episturmian word if it has exactly one right-special word of each length 
and moreover each right-special factor u is such that r(u) = Caxd(A). 

A Sturmian set is a set of words which is the set of factors of a strict epis- 
turmian word. Any Sturmian set is uniformly recurrent (see [S]). 

Example 2.1 Let A — {a, b}. The Fibonacci word is the fixpoint x — abaababa . . . 
of the morphism / : A* — > A* defined by f(a) — ab and f(b) = a. It is a Stur- 
mian word (see |17|). The set F(x) of factors of x is the Fibonacci set. 

Example 2.2 Let A = {a, b, c}. The Tribonacci word is the fixpoint x = 
f u {a) — abacaba--- of the morphism / : A* —> A* defined by /(a) = ab, 
f(b) = ac, /(c) = a. It is a strict episturmian word (see [II])- The set F(x) of 
factors of x is the Tribonacci set. 

2.2 Bifix codes 

A prefix code is a set of nonempty words which does not contain any proper 
prefix of its elements. A suffix code is defined symmetrically A bifix code is a 
set which is both a prefix code and a suffix code. 

A coding morphism for a prefix code X c A + is a morphism / : B* — > A* 
which maps bijectively B onto X. 

Let F be a factorial set of words. A prefix code X C F is F-maximal if it is 
not properly contained in any prefix code Y G 

We denote by X* the submonoid generated by X. A set X C F is right 
F -complete if any word of F is a prefix of a word in X* . A prefix code is 
F-maximal if and only if it is right F-complete (Proposition 3.3.2 in [S]). 

Similarly a bifix code X C F is f-maximal if it is not properly contained in 
a bifix code Y C F. For a recurrent set F, a finite bifix code is i^-maximal as a 

1 Note that in this paper we use C to denote the inclusion allowing equality. 
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bifix code if and only if it is an F-maximal prefix code (see [5], Theorem 4.2.2). 
For a uniformly recurrent set F, any finite bifix code X C F is contained in a 
finite F- maximal bifix code (Theorem 4.4.3 in [5]). 

A parse of a word w with respect to a set X is a triple (u, x, u) such that 
ui = vxu where v has no suffix in X, u has no prefix in X and x £ X* . We 
denote by 6x(w) the number of parses of w. By definition, the F-degree of X, 
denoted dp(A), is the maximal number of parses of a word in F. It can be 
finite or infinite. 

Let X be a bifix code. The number of parses of a word w is also equal to the 
number of suffixes of w which have no prefix in X and the number of prefixes 
of w which have no suffix in X (see Proposition 6.1.6 in [5]). 

The set of internal factors of a set of words X, denoted I(X) is the set of 
words w such that there exist nonempty words u, v with uwv G X . 

Let F be a recurrent set and let X be a finite F-maximal bifix code of F- 
degree al. A word w € F is such that Sx(w) < d if and only if it is an internal 
factor of X, that is 



(Theorem 4.2.8 in [5]). Thus any word of X of maximal length has d parses. 
This implies that the F-degree d is finite. 

Example 2.3 Let F be a recurrent set. For any integer n > 1, the set F n A n 
is an F-maximal bifix code of F-degree n. 

3 Neutral sets 

In this section, we introduce the neutrality condition and neutral sets. We prove 
a theorem concerning the cardinality of an F-maximal bifix code in a neutral 
set F (Theorem [O]). 

3.1 The neutrality condition 

We say that a word w € F is neutral with respect to a set of words F if 



We will use the following elementary result. 

Lemma 3.1 Let F be a factorial set. A word w £ F is neutral if and only if 



I{X) = {w e F \ S x (w) < d} 



e(w) — £(w) + r(w) — 1. 



r(w) - 1 = T,aeL(w)( r ( aw ) - !)■ 



Proof. This follows directly from the fact that 
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A biextendable word is called ordinary if E(w) CaxAUixdfor some 
(a, b) <G E(w) (see jS], Chapter 4). If F is biessential any ordinary word is 
neutral. Indeed, one has E(w) — (ax (R(w) \ b)) U ((L(w) \ a) x b) U (a, 6) and 
thus e(w) — £(w) + r(w) — 1. 

For example, in a Sturmian set, any word is ordinary. 

We say that F satisfies the neutrality condition if it is factorial and every 
biextendable word w £ F is neutral. We also say that F is a neutral set. 
The function n i-> Card(F n A n ) is called the complexity of F. 

Proposition 3.2 The complexity of a neutral set F is kn+1 with k = Card(i 7 'n 
A) -I. 

Proof. Assume that F satisfies the neutrality condition. We have Card(F n 
A") = kn + 1 for n < 1. Next, assuming that it is true for n > 1, we have 

Cardan A n+1 ) = ^ e(w) = £ (£(w) + r(w) - 1) 
weA"- 1 nF w£A n - 1 nF 
= Card^ n A n ) + Card(F n A n ) - Card( J F n A n ~ 1 ) 
= fc(n + l) + l 

showing the property for n + 1. ■ 
Note that if the complexity of F is fcn + 1 , then 

E W*) - !) = fc - 

Indeed, we have EseFnA"- 1 ^^) ~ X ) = Card(F n A" ) - C&rd(F n A 71 " 1 ) = fc. 

We now give an example of a set of complexity 2n + 1 on an alphabet with 
three letters which does not satisfy the neutrality condition. 

Example 3.3 Let A = {a,b,c}. The Chacon word on three letters is the 
fixpoint x — f u {a) of the morphism / from A* into itself defined by f(a) = aabc, 
f(b) = be and /(c) = abc. Thus x — aabcaabcbcabc ■ ■ ■ . The Chacon set is the 
set F of factors of x. It is of complexity 2n + 1 (see [T^] Section 5.5.2). 

It does not satisfy the neutrality condition because the word w = abc is such 
that L(w) — {a, c}, R(w) — {a, b} and E(w) = {(a, a), (c, a), (a, b), (c, &)}. Since 
e(w) = £(w) + r(w), the word w is not neutral. 

3.2 The Cardinality Theorem 

The following result is proved in [S] in the less general case of a Sturmian set. 

Theorem 3.4 Let F be a recurrent set containing A and let X C F be a finite 
F -maximal bifix code of F -degree d. If F satisfies the neutrality condition, then 
Card(A) = 1 + d(Card(A) - 1). 
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Note that, for a recurrent neutral set F, a bifix code X C F may be infinite 
since this may happen for a Sturmian set F (see [5]). 

We consider rooted trees with the usual notions of root, node, child and 
parent. The following lemma is well-known as a lemma on trees relating the 
number of its leaves to the sum of the degrees of its internal nodes. 

Lemma 3.5 Let X be a prefix code and let P be the set of its proper prefixes. 
Then Card(X) = 1 + £ peP (r(f>) - 1). 

We order the nodes of a tree from the parent to the child and thus we have 
to < n if to is a descendant of n. 

Lemma 3.6 Let T be a finite tree with root r on a set N of nodes, let d > I, 
and let n, a be functions assigning to each node an integer such that 

(i) for each internal node n, 7r(n) = £ 7t(to) where the sum runs over the 
children of n, 

(ii) for each leaf m of T , one has X)m<n a ( 71 ) = 
Then Y, n eN a(n)ir(n) = dir(r). 

Proof. We use an induction on the number of nodes of T . If T is reduced to 
its root, then d = a(r) implies a(r)7r(r) = dir(r) and the result is true. Assume 
that it holds for trees with less nodes than T . Since T is finite, there is a 
node such that all its children are leaves of T. Let to be such a node. Since 
J2x<n a ( n ) — ot(x) + J2 m <n a ( n ) has the same value for each child x of to, the 
value v = a(x) is the same for all children of to. Let T' be the tree obtained 
from T by deleting all children of to. Let TV' be the set of nodes of T". Let tt' 
be the restriction of tt to N' and let a' be defined by 



a'{n) 



a(n) if n 7^ to 

a(m) + v otherwise. 



It is easy to verify that T', tt' and a' satisfy the same hypotheses. Then 



a(77)7r(77) = 0(77)^(77) + a(77T,)7r(TO) + U7r(a 



E 

riGJV 7i£N'\m x<m 



a' (77)71"' (77) + a(m)7r(m) + v ir(x) 

n€:N'\m x<m 

a' (n)ir' (n) + (a(m) + v)ir(m) 

neN'\m 

a'(n)7r'(n) + a'(m)ir'(m) = a'(n)ir'(n) 



E 

7ieN'\m neN 



whence the result by the induction hypothesis. 

Proof of Theorem \3A[ Let k be larger than the lengths of the words of X. 



8 



Let N be the set of words of F of length at most k. By considering each 
word w as the father of aw for a E A, the set N can be considered as a tree T 
with root the empty word e. The leaves of T are the elements of F of length k. 

For w € N, set 7r(w) = r(w) — 1 and let 




1 if n is a proper prefix of X 
otherwise. 



Let us verify that the conditions of Lemma l3.6l are satisfied. Let u be in N. Then 
by Lemma l3.1i y^ aCL ^ u - ) ('r(aM) — 1) = r(u) — l. This implies that ^2 au£F ir(au) = 
tt(u) showing that condition (i) is satisfied. 

Let wbea leaf of T, that is a word of F of length k. Since k is larger than 
the maximal length of the words of X, the word w is not an internal factor of 
X and thus it has d parses with respect to X and thus has d suffixes which are 
proper prefixes of X. Thus ^2 w<u ce(u) — d. Thus condition (ii) is also satisfied. 

By Lemma 13.61 we have ^3„ gJV Oi(n)ir(n) — chr(e). Let P be the set of 
proper prefixes of X. By definition of a, we have J2 n eN o{n)n(n) = J2 p ep n ip) 
and thus by definition of n, dir(s) — d(Card(A) — 1) = X) P ep( r (p) — !)• By 
Lemma 13.51 we have Card(X) = 1 + Yl P ep( r ip) — 1) an d thus we obtain 
Card(X) = 1 + d(Card(A) — 1) which is the desired conclusion. ■ 

We illustrate Theorem 13.41 in the following example. 

Example 3.7 Let B = {x, y, z, t}. Let G be the set of words on the alphabet 
B represented in Figure l3~Tl on the left. The set G can be extended by adding 
words of length 5 and more, to a recurrent set still denoted G which satisfies 
the neutrality condition (see Example 15. 25p . 

The tree of right-special words is represented on the right in Figure |3~T1 with 
the value of r indicated at each node. The bifix codes Y = {xx, xyx, xz, xt, y, zx, tx} 
and Z = {x,yxy,yxz, zxxz, zxxt,txxz,txy} are G-maximal and have both G- 
degree 2. In agreement with Theorem 13.41 we have Card(F) = Card(iT) = 
1 + 2(Card(B) — 1) = 7. The codes Y and Z are represented in Figure |3"?2"1 The 
right-special proper prefixes p of Y and Z are indicated in black in Figure 13.21 
with the value of r(p) — 1 indicated for each one. In agreement with Lemma 13.61 
the sum of the values of r(p) — 1 is 6 in both cases. 

The following example shows that Theorem 13.41 may be false for a set F of 
complexity kn + 1 on an alphabet with k + 1 letters which docs not satisfy the 
neutrality condition. 

Example 3.8 Consider again the Chacon set F of Example 13.31 Let X — 
F n A 4 and let Y be the F-maximal bifix code of i^-degree 4 represented in 
Figure 13.31 which is obtained from X by internal transformation with respect 
to w = abc (see [S] for the definition of the internal transformation). The bifix 
code Y has 10 elements and thus Theorem 13.41 is not satisfied. 
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Figure 3.1: A set G satisfying the neutrality condition and the tree of right- 
special words. 

4 Tree sets 

We introduce in this section the tree condition and a variant called the planar 
tree condition. Each of these conditions has a stronger version called the strong 
tree (resp. planar tree) condition. We prove that the tree condition implies 
the strong tree condition (Proposition I4.6|) and that the planar tree condition 
implies the strong planar tree condition (Proposition I4.9| ) 




Figure 3.2: Two G-maximal bifix codes of G-degree 2. 
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Figure 3.3: An F-maximal bifix code of F-degree 4. 
4.1 The tree condition 

For a biextendable word w, we consider the undirected graph G(w) on the set 
of vertices which is the disjoint union of L(w) and R(w) with edges the pairs 
(a, b) € E(w). 

Recall that an undirected graph is a tree if it is connected and acyclic. 

We say that F satisfies the tree condition if it is biessential and if for every 
word w € F, the graph G(w) is a tree. We also say that F is a tree set. 

Note that for a biessential set F, the graph G(w) is a tree for any ordinary 
word w (but G(w) can be a tree although w is not ordinary). 

Note that a biessential set F satisfies the tree condition if and only if the 
graph G(w) is a tree for every bispecial word w. Indeed, if w is not bispecial, 
then G(w) is always a tree. 

Example 4.1 A Sturmian set F satisfies the tree condition. Indeed, F is 
biessential and every bispecial word is ordinary. 

Clearly the tree condition implies the neutrality condition. The following 
example shows that the converse is not true. 

Example 4.2 Let A = {a, b, c} and let F be the set of factors of a*{bc, bcbc}a*. 
The set F is biessential. One has F fl A 2 — {aa,ab,bc,cb,ca}. It satisfies the 
neutrality condition. Indeed the empty word is neutral since e(e) = Card(F n 
A 2 ) = 5 = £(e) + r(e) — 1. Next, the only nonempty bispecial words are be 
and a n for n > 1. They are neutral since e(bc) = 3 = £(bc) + r(bc) — 1 and 
e(a") — 3 = £(a n ) + r(a n ) — 1. However, F does not satisfy the tree condition 
since the graph G(e) contains a cycle (see Figure |4~T]) . 

In the last example, the set is not recurrent. We present now an example, due 
to Julien Cassaigne [9], of a uniformly recurrent neutral set which is not a tree 
set. 
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Figure 4.1: The graph G(e) for the set F. 



Example 4.3 Let A = {a, b, c, d} and let a be the morphism from A* into itself 
defined by 

a(a) = ab, a(b) = cda, er(c) = cd, a(d) = abc. 

Let F be the set of factors of the infinite word <r w (a) (see Figure 14.21 on the 
left). Since a is primitive, F is uniformly recurrent. We will show that all 
the nonempty words of F are neutral. We will use some properties of the set 




Figure 4.2: The sets F and G. 



X = cr(A). Observe first X is a suffix code. It has even the stronger property 
that distinct words of X end with distinct letters. The set X is not a prefix 
code but satisfies the following weaker property. If x, x', y £ X and if £ X* are 
such that xy is a prefix of x'y' , then x = x' (the set X said to be weakly prefix). 

As a third property, the set X has synchronizing pairs. A pair u, v of words is 
synchronizing if for all words p, q, if puvq £ X* , then pu, vq £ X* . For example 
(c, a) is a synchronizing pair. 

Note that if (r, s) and (u, v) are synchronizing pairs, then qrstuvw £ F 
implies stu £ X* 

We first show the following properties. 

1. If a left-special word of length at least 5 begins with a (resp. c), it begins 
with abcda (resp. cdabc). 

2. If a right-special word of length at least 5 ends with a (resp. c), it ends 
with abcda (resp. cdabc). 
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Indeed, the left-special words of length at most 5 beginning with a are the 
prefixes of abcda. This implies that any left-special word beginning with a begins 
with abcda. 

The three other assertions can be proved in an analogous way. 

Let us now show that any nonempty bispecial word of F is neutral. We use 
an induction on the length of the word to prove that the graph of a nonempty 
bispecial word is, according to its first and last letter, equal to one of the eight 
graphs of Figure 14.31 The assertion is true for words of length at most 4 since 
a, c, abc and cda are the bispecial words of length at most 4. 




Figure 4.3: The graphs of bispecial words, according to their first and last letter. 

Assume that v is a bispecial word of length at least 5. Assume first that v 
begins and ends with a. As seen previously, v begins and ends with abcda. 
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ab ab 
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cd abc 


ab ab 


cd 
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Figure 4.4: A bispecial word beginning and ending with a. 



Set v = ucda. Since (d, ab) and (&, cd) are synchronizing and since cducda £ 
F, we have u = a(w) with cw G F. Since cv G F, and since (c, a) is synchroniz- 
ing, we have also abcv G F. Thus dw G F (see Figure l4~4l on the left). 

Next, we have vb G F. Since (d, a) is synchronizing, we have wed G F and 
vbc G F (see Figure [4~4l on the right). Similarly, since vc G F, we have wbc G F 
and ved G F (see Figure l4~4l on the left). Thus w is a bispecial word which 
begins and ends with a shorter than v. By induction hypothesis, the graph of 
w is equal to one of the two first graphs of Figure 14.31 In both cases, we have 
cwb, dwe G F and thus dvc, cvb G F. Next dwb G F if and only if eve G F. Thus 
G(w) is one of the graphs if and only if G(v) is the other one. This proves the 
property in this case. 

The other cases are treated similarly. 

Let B = {1, 2, 3} and let r : A* -> B* be defined by 

t{o) = 12, r(b) = 2, r(c) = 3, r{d) = 13. 

Let G be the set of factors of the infinite word T(er"(a)) (see Figure [4721 on the 
right). 

The set Y = r(A) is a prefix code. It is not a suffix code but it is weakly 
suffix, which means that the reversal of Y is weakly prefix. 
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Let g : {a, c}A* n A* {a, c} — > B* be the map defined by 



3t(w) if w begins and ends with a 



3t(w)1 if w begins with a and ends with c 
2t(w) if u> begins with c and ends with a 
2t(w)1 if if begins with c and ends with c 



It can be verified, using the fact that Y is a prefix and weakly suffix code, that 
the set of nonempty bispecial words of G is the union of 2, 31 and of the set 
g(S) where S is the set of nonempty bispecial words of F. One may verify that 
the words of g(S) are neutral. Since the words 2, 31 are also neutral, the set G 
is neutral. It is uniformly recurrent since F is uniformly recurrent. The set G 
does not satisfy the tree condition since the graph G(e) is not connected (see 
Figure IP]), 



4.2 The planar tree condition 

Let <i and <2 be two orders on A. For a set F and a word w E F, we 
say the the graph G(w) is compatible with the orders <i and <i if for any 
(a, b), (c, d) € E(w), one has 



Thus, placing the vertices of L(w) ordered by <i on a line and those of R(w) 
ordered by <2 on a parallel line, the edges of the graph may be drawn as straight 
noncrossing segments, resulting in a planar graph. 

We say that a biessential set F satisfies the planar tree condition with respect 
to two orders <i and <2 on A if for any w E F, the graph G(w) is a tree 
compatible with <i, <2. 

The following example shows that the Tribonacci set does not satisfy the 
planar tree condition. 

Example 4.4 Let F be the Tribonacci set (see examplc l2.2l) . The words a, aba 
and abacaba are bispecial. Thus the words ba, caba are right-special and the 
words ab, abac are left-special. The graphs G(e), G(a) and G(aba) are shown in 
Figure l4~6l One sees easily that it not possible to find two orders on A making 
the three graphs planar. 




© ® 



Figure 4.5: The graph G(e) for the set G. 
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Figure 4.6: The graphs G(e),G(a) and G(aba) in the Tribonacci set. 



4.3 The strong tree condition 

Let F be a set. For w € F, let U C A* be a finite suffix code and let U(w) = 
{££U \£w £ F}. Let V C A* be a finite prefix code and let V(w) = {r G V" | 
w £ f}. Let G[/,v(«j) be the following undirected graph. The set of vertices is 
made of two disjoint copies of U(w) and V(w). The edges are the pairs {£, r) for 
£ € C/(w) and r e V(u>) such that £wr 6 F. The graph G(iu) defined previously 
corresponds to the case where U, V = A. 

Example 4.5 Let F be the Fibonacci set. Let w = a, U — {aa, ba, b} and let 
V = {aa, ab, b}. The graph Gu,v(w) is represented in Figure l4?fl 



We say that F satisfies the strong tree condition if it is biessential and if 
for every triple w, U, V with w € F, U a finite maximal suffix code and V a 
finite maximal prefix code, the graph Gu,v{w) is a tree. Note that we might 
as well use for U a finite _F-maximal suffix code and for V a finite F-maximal 
prefix code. Clearly the strong tree condition implies the tree condition. The 
following result shows that they are actually equivalent. 

Proposition 4.6 If a set satisfies the tree condition, it satisfies the strong tree 
condition. 

The proof uses the following lemma. 

Lemma 4.7 Let F be a biessential set. Let w £ F, let U be a finite suffix 
code and let V be a finite prefix code. Assume that there exists £ € F such 
that £w e F and A£ C U. Set U' = (Z7 \ At) U I. If the graphs Gu>y(w) and 
Ga,v(^ w ) are trees then Guy{w) is a tree. 

Proof. Since F is left-essential, there is a letter a £ A such that a£w £ F and 
thus a£ £ U(w). Let us first show that Gjjy(w) is connected. 

Step 1. As a preliminary step, let us show that for each r £ V such that 
£wr £ F, there is a path from a£ to r in Gu.v(w). Indeed, since the graph 
Ga,v(£w) is connected there is a path from a to r in this graph. Thus there is 
a path from a£ to r in Gtj,v(w). 




Figure 4.7: The graph G V y{w). 
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Step 2. As a second step, let us show that for any m £ U'(w) \ £ and 
r € V(w), there is a path from m to r in G;y.y(u>). Indeed there is a path from 
m to r in Gj/'^u;). For each edge of this path of the form (£, s), by Step 1, 
there is a path from a£ to s in the graph Gjj,v{w). Thus there is a path from 
m to r in Gfy.y(uj). 

Step 3. In the same way as above, for each r G V(w;), there is a path from 
a£ to r in Gu.v{w). 

Consider now m <E U(w) and r 6 V(w). If m ^ yli, then m € U'(w) \ £ and 
thus, by Step 2, there is a path from m to r in Gjjy{w). Next, assume that 
m = W with b <E A. Since G^y^w) is connected, there is a path from b to o 
in Ga.v^w) and thus a path from m to in G[/ i y(w). By Step 3, there is a 
path from a£ to r in Guy(w). Thus there is a path from m to r in Gc/ ; y(w). 

Assume now that Gjjy{w) contains a cycle C. If the cycle does not use a 
vertex in U\ it defines a cycle in the graph Ga,v{£w) obtained by replacing each 
vertex a£ by a vertex a. Since Ga,v{£w) is a tree, this is impossible. If it uses a 
vertex of U' it defines a cycle of the graph Gu 1 y{w) obtained by replacing each 
vertex a£ by £. This is impossible since Gwy{w) is a tree. Thus Guy(w) is a 
tree. ■ 

Proof of Proposition 14.61 Let F be a set satisfying the tree condition. Let U 
be a finite maximal suffix code and let V be a finite maximal prefix code. We 
show by induction on the sum of the lengths of the words in U, V that for any 
w € F, the graph G[/,y(w) is a tree. Let w E F. 

If U,V = A, the property is true since F satisfies the tree condition. 

Otherwise, assume for example that U contains words of length at least 
2. Since U is a finite maximal suffix code, there is a word £ £ A* such that 
Al C U. Let U' = (U \ A£) U £. If £w £ F, then G uy {w) = Gu> tV (w) and thus 
the conclusion holds. 

Otherwise, by induction hypothesis, the graphs Gjj'.v{w) and Ga,v(£w) are 
trees. By lemma l4~7l the graph Guy{w) is a tree. ■ 

The following corollary shows that if one uses codes U, V which are not maximal, 
the corresponding graph will not always be connected but it will still be acyclic. 



Corollary 4.8 If a set F satisfies the tree condition, then for every w G F. 
every finite suffix code U and every finite prefix code V , the graph Gjj,v{w) is 
acyclic. 

Proof. Let U' (resp. V) be a finite maximal suffix (resp. prefix) code containing 
U (resp. V). Since F satisfies, by Proposition 14. 6\ the strong tree condition, 
the graph Gu>,v(w) is a tree. Since Gjj,v{w) is a subgraph of Gu',v{w) it is 
acyclic. ■ 

Given an order < on the alphabet A, the corresponding lexicographic order 
on words is defined by u < v if u is a proper prefix of v or if u — paq, v — pbr for 
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some words p, q, r and letters a, b such that a < b. A reverse lexicographic order 
is defined symmetrically by u < v if u is a proper suffix of i; or if u = qas, v = rbs 
for some words q, r, s and letters a, b such that a < b. 

Let <i be a reverse lexicographic order and <2 be a lexicographic order. For 
a word iu € F, a finite suffix code U C A* and a finite prefix code V C A* , we 
say that the graph Gu,v{w) is compatible with the orders <i and <2 if for any 
edges (u,v), (u',v') of Gu,v(w), one has 



Thus, as for the tree condition, the compatibility with the orders <i and <2 
expresses the fact that Gjj,v{w) may be drawn as a planar tree if U is ordered 
according to <i and V is ordered according to <2- 

We say that a biessential set F satisfies the strong planar tree condition 
with respect to <i and <2 if, for any w G F, any finite maximal suffix code 
U C A* and any finite maximal prefix code V C A*, the graph Gu,v{w) is a 
tree compatible with <i and <2- 

The following result shows that the planar tree condition is equivalent to 
the strong one. We assume given a reverse lexicographic order <i and a lexico- 
graphic order <2- 

Proposition 4.9 // a set satisfies the planar tree condition with respect to <\ 
and <2, it satisfies the strong planar tree condition with respect to the same 
orders. 

Proof. Let Fbea set satisfying the planar tree condition with respect to <i and 
<2 • Let w € F, let U C A* be a finite maximal suffix code and let V C A* be 
a finite maximal prefix code. We show by induction on the sum of the lengths 
of the words in U, V that for any w £ F the graph Gjj,v(w) is a tree compatible 
with the orders <i and <2- 

If U, V — A, the property is true since F satisfies the planar tree condition. 

Otherwise, assume for example that U contains words of length at least 2. 
Since U is a finite maximal suffix code, there is a word i such that Al C U. Let 
U 1 = (U\A£)Ut liiw F, then G uy (w) = G V ',v{w) and thus the conclusion 
holds. Otherwise, by Lemma \4 .71 the graph Gjj,v{w) is a tree. 

Consider two edges (u, v), (u r , v') of Gu,v(w). Assume that u <i u'. We 
show that v <2 v'. This implies that Gu,v(w) is compatible with <i,<2 and 
thus our conclusion. 

If it, v! (f: A£, then u, u' € U' and thus v <2 v' since Gu',v(w) is compatible 
with <i, <2 by induction hypothesis. 

Next, assume that u £ A£ and u' ^ A£. Since <i is a reverse lexicographic 
order, we have £ <i u and thus £ <i u' . Since (£, v) is an edge of Gjj\v (w) and 
since Gjj> ,v(w) is compatible with <x, <2> this implies that v < 2 v'. 

The case u ^ m' £ A£ is symmetrical. 

Assume finally that u, v! £ At. Set u — a£ and u' = a'£. Then (a, u), (a', w') 
are edges of Ga,v(£w). Since by induction hypothesis Ga,v{£w) is compatible 
with <x, <2, we have w <2 v'. m 
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We end this section with a consequence of Proposition 14.61 

Corollary 4.10 Let F be recurrent tree set and let f : B — > X be a coding 
morphism for an F '-maximal finite bifix code X . If F is a tree set, then f~ 1 {F) 
is a tree set. 

Proof. Since F is recurrent, it is biessential. It implies that is also 

biessential. Indeed, let u € f^ 1 (F) and let v = /(it)- Let r,s be words of F 
longer than all words of X such that rvs € F. Let r 1 (resp. s') be the suffix 
of r (resp. the prefix of s) which is in X. Then f~ 1 (r')uf~ 1 (s') is in f~ l (F). 
This shows that f~ 1 (F) is biessential. 

Let u G f^ 1 (F) and let v = f(u). Since F is a tree set, it satisfies the strong 
tree condition by Proposition Since X is a finite _F-maximal bifix code, it is 
both an i^-maximal suffix code and an i^-maximal prefix code. Thus the graph 
Gx,x(v) is a tree. Since G(u) is isomorphic with Gx.x(v), it is also a tree. 
Thus f^ 1 (F) is a tree set. ■ 

The following example shows that, under the hyptheses of Corollary 14.101 when 
F is a Sturmian set, the set f~ 1 (F) need not be Sturmian. 

Example 4.11 Let A = {a, b, c} and let F be the Tribonacci set. The set F is 
Sturmian and thus is a uniformly recurrent tree set. Let X = F n A 2 and let 
/ be the coding morphism for X defined by f(x) — aa, f(y) — ab, f(z) = ac, 
fit) = ba, f(u) = ca. Then, by Corollary 14. 101 the set f^ 1 (F) is a tree set. It 
is not Sturmian because y and t are right-special. 

5 Interval exchange sets 

In this section, we recall the definition and the basic properties of interval ex- 
change transformations. We discuss the relation with bifix codes and we show 
that the class of regular interval exchange sets is closed under inverse images by 
coding morphisms of finite maximal bifix codes (Corollary 15 .22|) . 

5.1 Interval exchange transformations 

Let us recall the definition of an interval exchange transformation (see [10] 
or [8]). 

A semi-interval is a nonempty subset of the real line of the form [a, /?[= 
{z e M | a < z < (3}. Thus it is a left-closed and right-open interval. For two 
semi-intervals A, T, we denote A < T if x < y for any x 6 A and t/eT. 

Let (A, <) be an ordered set. A partition {I a ) a eA °f [0, 1[ in semi-intervals 
is ordered if a < b implies I a < /&. 

Let A be a finite set ordered by two total orders <i and < 2 . Let (I a )aeA 
be a partition of [0, 1[ in semi-intervals ordered for <i. Let A a be the length of 
I a . Let pi a = Z) & < ia A b and v a = Ek 2U ^- Set a a = v a - fx a . The interval 
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exchange transformation relative to (I a )aeA is the map T : [0, 1[— > [0, 1[ defined 

by 

T(z) = z + a a if z G la- 
Observe that the restriction of T to I a is a translation onto J a — T(I a ), that 
fj, a is the right boundary of I a and that v a is the right boundary of J a . We 
additionally denote by 7 the left boundary of I a and by S a the left boundary 
of J a . Thus 

la = [7o,Mo[ 3 = [<5a,^a[- 

Since a <2 b implies J a <2 J&, the family (J a )aeA is a partition of [0, 1[ 
ordered for <2- In particular, the transformation T defines a bijection from 
[0, 1[ onto itself. 

An interval exchange transformation relative to (I a )aeA is also said to be 
on the alphabet A. The values (a a ) a £A are called the translation values of the 
transformation T. 

Example 5.1 Let R be the interval exchange transformation corresponding to 
A = {a, b}, a <i b,b < 2 a, I a — [0, 1— a[, h = [1— a, 1[. The transformation R is 
the rotation of angle a on the semi-interval [0, 1[ defined by R(z) = z + a mod 1. 

Since <i and <2 are total orders, there exists a unique permutation it of A such 
that a <i b if and only if 7r(a) <2 tt(&)- Conversely, < 2 is determined by <i 
and 7r and <i is determined by < 2 and 7r. The permutation tt is said to be 
associated to T. 

If we set A — {a\, a%, ■ ■ ■ , a s } with ai <\ a-i <\ ■ ■ ■ <i a 8 , the pair (A, w) 
formed by the family A = (X a )a<£A and the permutation tt determines the map 
T. We will also denote T as T\^. The transformation T is also said to be an 
s-interval exchange transformation. 

It is easy to verify that the family of s-interval exchange transformations is 
closed by composition and by taking inverses. 

Example 5.2 A 3- interval exchange transformation is represented in Figure lOl 

One has A = {a, b, c} with a <i b <i c and b < 2 c < 2 a. The associated permu- 
tation is the cycle tt — (abc). 




Figure 5.1: A 3-interval exchange transformation 
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5.2 Regular interval exchange transformations 

The orbit of a point z 6 [0, 1 [ is the set {T™(z) | n € Z}. The transformation T 
is said to be minimal if for any z 6 [0, 1[, the orbit of z is dense in [0, 1[. 

Set yl = {<2i, 02, ■ • ■ , a s } with a x <j a-i <\ ... <i a s , fa — jj, ai and <5i = 
S ai . The points 0, fj,i, . . . , /^ s -i form the set of separation points of T, denoted 
Sep(T). 

An interval exchange transformation 7\ i7r is called regular if the orbits of 
the nonzero separation points /ii, . . . , fJ, s -\ are infinite and disjoint. Note that 
the orbit of cannot be disjoint of the others since one has T(/ij) = for some 
i with 1 < i < s. 

A regular interval exchange transformation is also said to satisfy the idoc 
condition (where idoc stands for infinite disjoint orbit condition). 

Note that since 5% = T(/j,i), . . . , S s = T(/x s _i), T is regular if and only if the 
orbits of 62, . • ■ , S s are infinite and disjoint. 

As an example, the 2-interval exchange transformation of Example 15 . 1 1 which 
is the rotation of angle a is regular if and only if a is irrational. The following 
result is due to Keane [16]. 

Theorem 5.3 (Keane) A regular interval exchange transformation is mini- 
mal. 

The converse is not true. Indeed, consider the rotation of angle a with a 
irrational, as a 3-interval exchange transformation with A = (1 — 2a, a, a)and 
7r = (132). The transformation is minimal as any rotation of irrational angle 
but it is not regular since = 1 — 2a, jj,2 = 1 — a and thus ^2 = T(fii). 

The following necessary condition for minimality of an interval exchange 
transformation is useful. A permutation 7r of an ordered set A is called de- 
composable if there exists an element b G A such that the set B of elements 
strictly less than b is nonempty and such that tt(B) = B. Otherwise it is called 
indecomposable. If an interval exchange transformation T = T\ n is minimal, 
the permutation tt is indecomposable. Indeed, if B is a set as above, the set of 
orbits of the points in the set S = U ae s/ a is closed an strictly included in [0, 1[. 

The following example shows that the indecomposablity of n is not sufficient 
for T to be minimal. 

Example 5.4 Let A = {a,b,c} and A be such that A a = A c . Let it be the 
transposition (ac). Then n is indecomposable but T\ >n is not minimal since it 
is the identity on 

5.3 Natural coding 

Let T be an interval exchange transformation relative to (I a ) a ^A- For a given 
real number z G [0, 1[, the natural coding of T relative to z is the infinite word 
Yit{z) = a^ai • • • on the alphabet A defined by 

a n =a if T n {z) G I a . 
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For a word w = bobi ■ ■ ■ fc m _i, let I w be the set 

i w = i bo n T-\i bl ) n . . . n t^+V^-J- (5.1) 

Note that each I w is a semi-interval. Indeed, this is true if w is a letter. Next, 
assume that I w is a semi-interval. Then for any nei, T(I aw ) = T(I a ) D I w is a 
semi-interval since T{I a ) is a semi-interval by definition of an interval exchange 
transformation. Since I aw C 7 , T(I aw ) is a translate of I aw , which is therefore 
also a semi-interval. This proves the property by induction on the length. 
Set J w = T m (I w ). Thus 

J w =T m (I bo )nT m - 1 (I bl )n...DT(I bm _ 1 ). (5.2) 

In particular, we have J a = T(I a ) for a G A. Note that each J w is a semi- 
interval. Indeed, this is true if w is a letter. Next, for any a G A, we have 
T (J wa ) — J w H J . This implies as above that J^q is a semi-interval and 
proves the property by induction. We set by convention I e = J £ = [0, 1[. Then 
one has for any n > 

a n a n+ i ■ ■ ■ a n+m -i = w T n (z) e I w (5.3) 

and 

a n - m a n - m +i ■ ■ ■ a n -i = w T n (z) G J w (5.4) 
Let (a a ) agj 4 be the translation values of T. Note that for any word w, 

Jw = Iw + ol w (5.5) 

with a w = X^Icj 1 ^fcj as one ma y verify by induction on |to| = m. Indeed 
it is true for m = 1. For to > 2, set w — ua with a = & m — l- One has 
T m (i™) = r m - 1 (/„,) + a Q and T" 1 - 1 ^) = + by the induction hypothesis 
and the fact that I w is included in I u . Thus J w — T m (I w ) = I w + a u + a a — 
I w + ot w . Equation (|5.5I) shows in particular that the restriction of T' 1 "' to /„, 
is a translation. 

If T is minimal, one has w G F(Et(z)) if and only if I w ^ 0. Thus the 
set F(T,t(z)) does not depend on z (as for Sturmian words, see [T7]). Since it 
depends only on T, we denote it by F(T). When T is regular (resp. minimal), 
such a set is called a regular interval exchange set (resp. a minimal interval 
exchange set). 

Let X be the set of all St(^) for z G [0, 1[ and let S be the shift on X . Then 
we have the following commutative diagram. 

[0,1[^— [0,1[ 

T T 

Q 

X — — ► X 
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The pair (X, S) is a symbolic dynamical system, formed of a topological space 
X and a continuous transformation S. Such a system is said to be minimal if 
the only closed subsets invariant by S are or S. It is well-known that (X, S) is 
minimal if and only if F(S) is uniformly recurrent (see for example |17) Theorem 
1.5.9). The transformation T is minimal if and only if the symbolic dynamical 
system (X, S) is minimal (see [8] page 392). Thus, we obtain the following 
statement. 

Proposition 5.5 For any minimal interval exchange transformation T , the set 
F(T) is uniformly recurrent. 

Note that for a minimal interval exchange transformation T, the map T,t is 
injective (see [IS] page 30). 

The following is an elementary property of the intervals I u which will be 
used below. We denote by <i the lexicographic order induced by the order <i. 

Proposition 5.6 One has I u <i I v if and only ifu<\v and u is not a prefix 
ofv. 

Proof. For a word u and a letter a, it results from (|5.1[) that I ua = I u CiT~^(I a ). 
Since (I a ) a eA is an ordered partition, this implies that (T' n '(/ U ) (1 I a )aeA is an 
ordered partition of T^(I U ). Since the restriction of T'" to I u is a translation, 
this implies that (I U a)aeA is an ordered partition of /„. Moreover, for two words 
u, v, it results also from (|5.1jl that I uv = I u fl T~\ U \(I V ). Thus I uv C I u - 

Assume that u <\ v and that u is not a prefix of v. Then u = las and 
v = ibt with a, b two letters such that a <i b. Then we have Ig a <i la, with 
I u C ha and I v C la whence /„ <i /„. 

Conversely, assume that I u <i I v . Since I u fl I v = 0, the words u, v cannot 
be comparable for the prefix order. Set u — las and v — ibt with a, b two 
distinct letters. If b <i a, then I v <i I u as we have shown above. Thus a <i b 
which implies u <\v. m 

We denote by <2 the order on A* defined by u <2 v if u is a proper suffix 
of v or if u — waz and v — tbz with a <2 b. Thus <2 is the lexicographic order 
on the reverse of the words induced by the order <2 on the alphabet. 

We denote by 7r the morphism from A* onto itself which extends to A* the 
permutation 7r on A. Then u <2 v if and only if 7r _1 ('u) <i 7r _1 ({i). 

The following statement is the analogue of Proposition 15.61 

Proposition 5.7 Let T\ i7r be an interval exchange transformation. One has 
Ju <2 Jv if and only if u <2 V and u is not a suffix of v. 

Proof. Let (I' a ) a £A be the family of semi-intervals defined by V a — J^^y Then 
the interval exchange transformation T" relative to (I' a ) with translation values 
— a a is the inverse of the transformation T . The semi-intervals I' w defined by 
Equation (15. ip with respect to T' satisfy I' w — J^t™) or equivalently J w = 
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K-i&y Thus > J u <2 Jv if and only if I'^-i^ <i ^-1(5) if and only if ( by 
Proposition 15 -6[) n (u) <i 7r _1 (u) or equivalently u <2 v. ■ 

A map 7r : ^4* — > [0, 1] such that 7r(e) = 1 and for any word w 

7r(aw) = 7r(wa) = 7r(w). 

a£A aeA 

is called an invariant probability distribution on ^4* . 

Let 2~A )lr be an interval exchange transformation. For any word w G A*, 
denote by the length of the semi-interval /„, defined by Equation (15. ip . Set 
A(w) = |Zu,|. Then A(e) = 1 and for any word w 

X(aw) = A(wa) = A(u>). 
Thus A is an invariant probability distribution. 

Example 5.8 Let F be the set of factors of the Fibonacci word. It is the 
natural coding of the rotation of angle a = (3 — V5)/2 with respect to a (see [T7] , 
Chapter 2). The values of the map A on the words of length at most 4 in F are 
indicated in Figure l5~2l 



(T^)-^(r^2^-^r^^ 




Figure 5.2: The invariant probability distribution on the Fibonacci set. 

The following result is proved in [11] with a converse (see below). 

Proposition 5.9 Let T be an interval exchange transformation on A ordered 
by <i and < 2 . If T is regular, the set F(T) satisfies the planar tree condition 
with respect to < 2 and <i. 

Proof. Assume that T is a regular interval exchange transformation relative 
to (I a , a a ) a £A- Let x be the natural coding of T relative to z £ [0, 1[ and let 
F = F{x). 

Since T is minimal, w is in F if and only if I w ^ 0. Thus, one has 
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(i) b e R(w) if and only if I w n T~H(7 6 ) ^ and 

(ii) a € if and only if J a n I K , ^ 0. 

Condition (i) holds because I w b = I w H T~\ w \(Ib) and condition (ii) because 
law = la H r _1 (/u,), which implies T(I aw ) — JaC\ I w . In particular, (i) implies 
that (I w b)b£R(w) is an ordered partition of I w with respect to <i. 

For a, a' £ L(w), there is a unique path in G(w) from a to a' which is the 
sequence ai,b\, ...a n with a\ — a and a n = a' with a\ <2 «2 <2 ■• ■ <2 a n , 
61 <i 6 2 <i ■ • • <i &n-i an d J a , n 7^ 0, J ai+1 n ^0forl<i<7i-l 
(see Figure I5.3P . Note that the hypothesis that T is regular is needed here 
since otherwise the right boundary of J ai could be the left boundary of 
Thus G(w) is a tree. It is compatible with <2, <i since the above shows that 
a < 2 a' implies that the letters &i,6„_i such that (a, (a', 6„-i) € £(«;) 
satisfy 61 <i 6 n -i- ■ 



o- 

o o o 



-o 



o- 



-o 



Jo 2 



-o 



o- 



^Cln — l 



Figure 5.3: A path from ai to a„ in G(ui) 
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Since a set satisfying the tree condition satisfies the neutrality condition, we 
obtain the following well-known corollary. 

Corollary 5.10 If F is a regular interval exchange set, then Card(_F (1 A n ) = 
n(Card(A) - 1) + 1 for all n > 0. 

The main result of states that a uniformly recurrent set F on an alphabet 
A is a regular interval exchange set if and only if A C F and there exists two 
orders <i and <2 on A such that the following conditions arc satisfied for any 
word w € F. 

(i) The set L(w) (resp. R{w)) is formed of consecutive elements for the order 
< 2 (resp. <i). 

(ii) For (a, b), (c, d) € E(w), if a < 2 c, then b <i d. 

(iii) If a, 6 € i(w) are consecutive for the order <2, then the set R(aw)nR(bw) 
is a singleton. 

It is easy to see that a biessential set F containing A satisfies (ii) and (iii) 
if and only if it satisfies the planar tree condition. Actually, in this case, it 
automatically satisfies also condition (i). Indeed, let us consider a word w and 
a, 6, c £ A with a <i b <i c such that wa,wc £ F but wb F. Since b £ F 
there is a (possibly empty) suffix v of w such that vb £ F. We choose v of 
maximal length. Since wb ^ F, we have w = uv with it nonempty. Let c? be 
the last letter of u. Then we have dva, dvc £ F and dvb ^ F. Since G(f ) is a 
tree and b £ R(v), there is a letter e £ L(v) such that eu6 £ F. But e <2 d and 
d <2 e are both impossible since G(i>) is compatible with <2 and <i. Thus we 
reach a contradiction. 

This shows the following reformulation of the main result of [TTj . 
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Theorem 5.11 (Ferenczi, Zamboni) A set F is a regular interval exchange 
set on the alphabet A if and only if it is uniformly recurrent, contains A and 
satisfies the planar tree condition. 

The next example shows that there are uniformly recurrent tree sets which 
are neither Sturmian nor regular interval exchange sets. 

Example 5.12 Let F be the Tribonacci set on the alphabet A — {a,b,c} and 
let / : {x, y, z, t, u} be the coding morphism for X = F n A 2 defined in Ex- 
ample 0TTT] The set G = f^ 1 (F) is a tree set which is not Sturmian (see 
Example 14. lip . It is not a regular interval exchange set. Indeed, for any right- 
special word w of G, one has r(w) = 3. This is not possible in a regular interval 
exchange set T since, £t being injective, the length of the interval J w tends to 
as \w\ tends to infinity. 

5.4 Bifix codes and interval exchanges 

The following result is a particular case of a result from [5] (Proposition 3.3.4). 

Proposition 5.13 Let F be a recurrent set and let A be an invariant prob- 
ability distribution on F . For any finite F -maximal prefix code X, one has 

Let T be a minimal interval exchange transformation relative to (I a )aeA- 
Set F = F(T). 

The following statement is connected with Proposition l5.13l 

Proposition 5.14 Let X be a finite F -maximal prefix code ordered by <\. The 
family (L w ) we x is an ordered partition of [0, 1[. 

Proof. By Proposition 15. 61 the sets (I w ) for w £ X are pairwise disjoint. Let 
A be the invariant probability distribution on F defined by X(w) — \I W \. By 
Proposition 15.131 we have J2 w& x ^( w ) = 1- Thus the family (I w ) w ex is a 
partition of [0, 1[. By Proposition 15.61 it is an ordered partition. ■ 

Example 5.15 Let T be the rotation of angle a = (3 — y/5)/2. The set F = 
F(T) is the Fibonacci set. The set X = {aa, ab, b} is an F-maximal prefix code 
(see Figure l5Tf . The partition of [0, 1[ corresponding to X is 

I aa = [0, 1 - 2a[, Iab = [l-2a,l-a[, 7 6 = [l-a,l[. 

The values of the lengths of the semi-intervals (the invariant probability distri- 
bution) can also be read on Figure 15.21 

A symmetric statement holds for an F-maximal suffix code, namely that the 
family (J w ) w ^x is an ordered partition of [0, 1[ for the order <2 on X. 
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Figure 5.4: The prefix code X 



Proposition 5.16 If X is a finite F '-maximal bifix code, the families (I w )wex 
and (Jw)wex are ordered partitions of [0, 1[, relatively to the orders <i and <2 
respectively. 

Proof. This results from Proposition 15 . 141 and its symmetric and the fact that a 
finite F-maximal bifix code is both an i^-maximal prehx code and an .F-maximal 
suffix code. ■ 

Let T be a minimal interval exchange transformation relative to (I a )aeA- Let 
(&a)aeA be the translation values of T. Let x be an infinite word on A which is 
a natural coding of T. Set F = F(x). Let X be a finite F-maximal bifix code 
on the alphabet A. 

Let Tx be the transformation on [0, 1[ defined by 

T x (z) = tM{z) if zel u 

with MgX. The transformation is well-defined since, by Proposition 15. 161 the 
family (I u )uex is a partition of [0, 1[. 

Let / : B* — > A* be a coding morphism for X . Let (K^jbeB be the family 
of semi-intervals indexed by the alphabet B with K & = Ifn,) ■ We consider B as 
ordered by the orders <i and < 2 induced by /. Let Tf be the interval exchange 
transformation relative to (K^^b- Its translation values are /3b = X)j=o aa i 
for f(b) — aodi • • • <x m _i.The transformation Tf is called the transformation 
associated with /. 

Proposition 5.17 One has Tf = Tx- 

Proof. By Proposition 15.161 the family (K^^b is a partition of [0, 1[ ordered 
by <i. For any w € X, we have by Equation (|5.5[) J w = I w + a w and thus 
Tx is the interval exchange transformation relative to (K^^b with translation 
values /3ft. ■ 

In the sequel, under the hypotheses of Proposition I5.17| we consider Tf as an 
interval exchange transformation. In particular, the natural coding of Tf relative 
to z £ [0, 1[ is well-defined. 

Example 5.18 Let F be the Fibonacci set. It is the set of factors of the 
Fibonacci word, which is a natural coding of the rotation of angle a = (3— a/5) /2 
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relative to a (see Example I5.8[) . Let X = {aa,ab,ba} and let / be the coding 
morphism defined by f(u) = aa, f(v) = ab, f(w) = ba. The two partitions of 
[0, 1[ corresponding to Tt are 

7 u = [0,l-2a[, I v = [l-2a,l-a[ I w = [1 - a,l[ 

and 

J v = [0,a[, J w = [a,2a[ J„ = [2a,l[ 
The first row of Table [5~ll gives the two orders on X. The next two rows give 



(*,<l) 


(X,< 2 ) 


aa, ab, ba 


ab, ba, aa 


a, baab, bab 


bab, baab, a 


aa, aba, b 


b, aba, aa 



Table 5.1: The two orders on the three F-maximal bifix codes of .F-degree 2 

the two orders for each of the two other i^-maximal bifix codes of F-degree 2. 

Let T be an minimal interval exchange transformation on the alphabet A. 
Let x be the natural coding of T relative to some z £ [0, 1[. Set F — F(x). Let 
X be a finite F-maximal bifix code. Let / : B* — > A* be a morphism which 
maps bijectively B onto X. The decoding of the infinite word x with respect to 
/ is the infinite word y on the alphabet B such that x — f(y). 

Proposition 5.19 The decoding of x with respect to f is the natural coding of 
the transformation associated to f relative to z: Y,t{z) = f(Y.T f (z)). 

Proof. Let y = 60&1 ■ ■ • be the decoding of x with respect to /. Set xi = f{bi) 
for i > 0. Then, for any n > 0, we have 

Tf{z) = T^[z) (5.6) 

with u n — xo ■ ■ ■ x n -\. Indeed, this is is true for n — 0. Next Tj? +1 (z) = Tf(y) 
with y = TJ{z). Arguing by induction, we have y = T' M,l '(z). Since x = 

u n x n x n +i ■ ■ ■ , y is in I Xn by (|5.3p . Thus by definition of Tf, Tf(y) = T' x "l(y) 
and we obtain T" +1 (z) = T^(T^(z)) = T^+^(z) proving ([576]) . Finally, 
for u = f(b) with b £ B, 

b n = b^ x n =u^ T^(z) £ I u Tf{z) £l u = K b . 

showing that y is the natural coding of Tf relative to z. 

■ 

Example 5.20 Let T, a, X and / be as in Example l5.18[ Let x = abaababa ■ ■ ■ 
be the Fibonacci word. We have x = St (a). The decoding of x with respect to 
/ is y = vuwwv 
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5.5 Bifix codes and regular transformations 

The following result shows that for the coding morphism / of a finite F-maximal 
bifix code, the map T i— > Tf preserves the regularity of the transformation. 

Theorem 5.21 Let T be a regular interval exchange transformation and let 
F = F(T). For any finite F -maximal bifix code X with coding morphism f , the 
transformation Tf is regular. 

Proof. Set A = {a%, a-z, . . . , a s } with a\ <i 02 <i ••• <i a s . We denote 
Si = S ai . By hypothesis, the orbits of 62, ■ ■ ■ ,S S are infinite and disjoint. Set 
X = {x\, a?2, • • • , xt} with x\ <\ X2 <i • • ■ <i xt- Let d be the -F-degree of X. 

For x £ X, denote by 5 X the left boundary of the semi-interval J x . For each 
x € X, it follows from Equation (|5.2[) . that there is an i € {1, . . . , s} such that 
8 X = T k {8i) with < k < \x\. For 1 < i < s, let 

I, = {1 £ I I 4 = T fc (<y, for some fc with < k < \x\} 

We will verify that the family (Xi)2<i< s is a partition of X \ x\ in sets with 
d elements. Let us prove first that X \ x\ — Uj =2 Xi. Indeed, set Y — Uf =2 Xi. 

For x e Xi, let k with < k < \x\ be such that 5 X = T k (Si). If k < 
\x\-l, then S x = T k (S!) = T k+1 (0) since 61 = T(0). Since = Si for some 

1 £ {2, . . . , s}, we have iGFin this case. Finally, if i = 1 and fe = |a;| — 1, then 
^ = T fe (<5i) = Tl x l(0), which implies that x = x x . 

Next, since T is regular, the sets Xi are pairwise disjoint. Finally, for each 
x £ Xi, let k be such that S x = T k (Si) and < k < \x\. Then, by Equation (|5.4|) . 
the prefix of length A: of (Si ) is a proper suffix of X . Since {Si ) has d prefixes 
which are proper suffixes of X , this implies that Card(JQ) < d. Thus 

s 

Card(X) - 1 < ^ Card(Xi) < d(s - 1) 

i=2 

Since Card(X) = c?(s — 1) + 1 by Theorem 13.41 this implies that each Xi for 

2 < i < s has d elements. 

We will now finally show that the orbits of S X2 , . . . , 5 Xt for the transformation 
Tf are infinite and disjoint. Assume that S Xp = TJ(5 Xq ) for some p,q £ {2, . . . , t} 
and n £ 7L. Interchanging p, q if necessary, we may assume that n > 0. Let 
i, j £ {2, . . . , s} be such that 8 Xp = T k (8i) with < k < \x p \ and S Xq = T e {Sj) 
with < I < \x q \. Since T k {S-) = Tf(T £ (Sj)) = T m+i (Sj)) for some m > 0, 
we cannot have i ^ j since otherwise the orbits of Si, 5j for the transformation 
T intersect. Thus i = j. Since S Xp — T k (8i), we have £t(<5i) = uT,t(S Xp ) with 
\u\ = k, u proper suffix of x p . And since S Xp = T^(S Xq ), we have ~E,t{S Xp ) = 
xY,T{S Xq ) with x £ X* . Since on the other hand 5 Xq — T l (Si), we have Et(5i) = 
vYiT(8 Xq ) with \v\ = £ and v a proper suffix of x q . But since t> cannot have a 
suffix in X , v = ux implies x = 1 and thus n = and p = q. This concludes the 
proof. ■ 
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The following corollary shows that the class of regular interval exchange sets is 
closed under taking inverse images by coding morphisms of maximal bifix codes. 



Corollary 5.22 Let F be a regular interval exchange set. If f is a coding 
morphism of a finite F-maximal bifix code the set f^ 1 (F) is a regular interval 
exchange set. 

Proof. Let T be a regular interval exchange transformation such that F = F(T). 
By Theorem 15.211 Tt is a regular interval exchange transformation. We show 
that f^ 1 (F) — F(Tf), which implies the conclusion. 

Let x = St(z) for some z € [0, 1[ and let y = f~ 1 (x). Then F = F(x) and 
F(Tf) = F(y). For any w € F(y), we have f(w) <E F(x) and thus w £ f~ 1 (F). 
This shows that F(T f ) C / _1 (F). Conversely, let w G f^(F) and let v = f(w). 
Since F = F(x), there is a word u such that uv is a prefix of x. Set z' = Tl u ' (z) 
and x' — Y>t{z'). Then v is a prefix of x' and w is a prefix of y' = f^ 1 {x'). 
Since Tf is regular, it is minimal and thus F(y') = F(Tf). This implies that 
weF(T f ). 

Since a regular interval exchange set is uniformly recurrent, Corollary 15.221 
implies in particular that if F is regular interval exchange set and / a coding 
morphism of a finite F-maximal bifix code, then f^ 1 (F) is uniformly recurrent. 
This is not true for an arbitrary uniformly recurrent set F, as shown by the 
following example. 

Example 5.23 Set A = {a, b} and B = {u,v}. Let F be the set of factors 
of (ab)* and let / : B* — > A* be defined by f(u) = ab and f(v) = ba. Then 
f^ 1 (F) = u* U v * which is not recurrent. 

Note in contrast that, by a result proved in [I], if a; is a recurrent word and 
/ : B* — s- A* is an injective morphism such that f(B) — F(x) n A n for some 
n > 1, then f~ 1 (x) is a recurrent word (Theorem 10.8.7). 

We illustrate the proof of Theorem 15.211 in the following example. 

Example 5.24 Let T be the rotation of angle a = (3 — VE)/2. The set F — 
F(T) is the Fibonacci set. Let X — {a, baab, babaabab, babaabaabab} . The set 
X is an F-maximal bifix code of F-degree 3 (see [5]). The values of the fi Xi and 
5 Xi are represented in Figure 1531 



f^Xl \^X1 t^Xzt^XA 

o o o o o 

3x2 ^Xi 1 

o o o o o 

Figure 5.5: The transformation associated with a bifix code of F-degree 3. 
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The infinite word £r(0) is represented in Figure \bM The value indicated 
on the word £r(0) after a prefix u is Tl"l(0). The three values S Xi ,S X2 ,S X3 
correspond to the three prefixes of £t(0) which are proper suffixes of X. 



Et(0)= 



a a 



Figure 5.6: The infinite word Ey(0). 



Example 5.25 Let F be the Fibonacci set and let X C F be the F-maximal 
bifix code of -F-degree 3 defined by X = {a,baabaab,baabab,babaab}. We con- 
sider the alphabet B = {x,y, z,t} and the morphism / : B* — >• A* defined by 
f(x) — a, f(y) — baabaab, f(z) — baabab, fit) = babaab. The set G = f^ 1 (F) 
is the set considered in Example 13.71 It is a regular interval exchange set and 
thus both recurrent and neutral. 



6 Finite index basis sets 

In this section, we state and prove the main result of this paper (Theorem l6.ip . 
The proof uses a graph called the incidence graph of a bifix code. We prove 
that if F is a tree set, the incidence graph of a bifix code X C F is acyclic 
(Proposition 16. 15[) . 

6.1 Main result 

Let F be a recurrent set. For w £ F, let Tp(w) — {x £ F | vox £ F n A + w} be 
the set of right return words to w and let Rf(w) = Tp(w) \ Tp(w)A + be the 
set of first right return words to w. 

Note that a recurrent set F is uniformly recurrent if and only if the set 
Rf{w) is finite for any w £ F. Indeed, if N is the maximal length of the words 
in Rp(w) for a word w of length n, then any word of length N + n contains an 
occurrence of w. The converse is obvious. 

We denote by (X) the subgroup generated by a set of words X. 

We say that F has the finite index basis property if the following holds. A 
finite bifix code X C F is an F-maximal bifix code of F-degree d if and only 
if it a basis of a subgroup of index d of the free group on A. Moreover, in this 
case, X* n F = (X) n F. We also say that F is a finite index basis set. 

We will prove the following result. 

Theorem 6.1 Any recurrent set F on the alphabet A such that 
(i) F is a tree set, 
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(ii) the set of first right return words to any w £ F is a basis of the free group 
on A 

has the finite index basis property. 

Note that the Cardinality Theorem (Theorem I3.4|) holds for a set F satis- 
fying the finite index basis property. Indeed, by Schreier's formula a basis of a 
subgroup of index d of a free group on s generators has (s — l)d + 1 elements 
(actually we use Theorem l3.4l in the proof of Theorem 16. ip . 

Note also that, by the above remark on first return words, condition (ii) 
implies that the set F is not only recurrent but also uniformly recurrent. 

Before proving Theorem 16.11 we first list some important corollaries. The 
first one is the main result of [5]. 

Corollary 6.2 A Sturmian set has the finite index basis property. 

Proof. A Sturmian set is uniformly recurrent. It satisfies condition (i) since all 
its words are ordinary and condition (ii) by Theorem 6.5.2 of [5]. ■ 

The second corollary is the main result of this paper. 

Corollary 6.3 A regular interval exchange set has the finite index basis prop- 
erty. 

Proof. A regular exchange set is minimal by Theorem 15.31 and thus uniformly 
recurrent by Proposition 15.51 It satisfies the tree property by Proposition 15.91 
Finally, it satisfies condition (ii) by Theorem 4.16 of [7]. ■ 

The following examples shows that Theorem 16.31 may be false for a set F 
which does not satisfy some of the hypotheses. 

The first example is a uniformly recurrent set which is not neutral. 

Example 6.4 Let F be the Chacon set (see Example I3.3[) . We have seen that 
F is not neutral and thus not a tree set. The set F l~l A 2 = {aa, ab, be, ca, cb} is 
an ^-maximal bifix code of _F-degree 2. It is not a basis since ca(aa)~ 1 ab = cb. 
Thus F does not satisfy the finite index basis property. 

In the second example, the set is neutral but not a tree set and is not uniformly 
recurrent. 

Example 6.5 Let F be the set of Example 14.21 It is not a tree set (and it is 
not either uniformly recurrent). The set F n A 2 is the same as in the Chacon 
set. Thus F does not satisfy the finite index basis property. 

In the last example we have a uniformly recurrent set which is neutral but not 
a tree set. 
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Example 6.6 Let G be the set on the alphabet B = {1, 2, 3} of Example 14.31 
We have seen that G is neutral but not a tree set. Thus condition (i) of Theo- 
rem [HII] is not satisfied. 

One has R G {2) = {2,312,31312}. The group generated by R G {2) has the 
basis {31, 2}, which is not a basis of the free group on {1, 2, 3}. Thus condition 
(ii) is not satisfied either. Let X = G n B 2 . We have X = {12, 13,22,23,31}. 
The set X is not a basis since 13 = 12(22) _1 23. Thus F does not satisfy the 
finite index basis property. 

We have no example of a uniformly recurrent tree set which does not have the 
finite index basis property. 

We close this section with a property showing that condition (i) in Theo- 
rem [H3] is necessary. 

Proposition 6.7 A biessential set F such that FC\A n is a basis of the subgroup 
(A n ) for all n > 1 is a tree set. 

Proof. Set k = Card(A) — 1. Since A n generates a subgroup of index n, the 
hypothesis implies that Card(^4" n F) — kn + 1 for all n > 1 . Consider w G F 
and set m = \w\. The set X = Aw A n F is included in F n A m+ ' 2 . Thus X is a 
basis of the subgroup (X) . 

This implies that the graph G(w) is acyclic. Indeed, assume that (oi, b\, . . . , 
a p , bp) is a cycle in G(w) with p > 2, a; € L(w), b% € R(w) for 1 < i < p and 
oi ^ Op. Then a\wb\, 02W61, . . . , a p wb p , a\wb p G X. But 

aiti;&i(a2wi)i) -1 a2«^2 • • ■ a p wb p (aiwbp)~ 1 = 1 

contradicting the fact that X is a basis. 

Since G(w) is an acyclic graph with £(w) +r(w) vertices and e(w) edges, we 
have e(w) < £(w) + r(w) — 1. But then 

Card(A m+2 n F) = J2 e H ^ Yl {t{w) + r{w) - 1) 

■w£A m nF w£A m nF 

< 2 Card(A m+1 OF) — Card(A m n F) 

< fe(m + 2) + l. 

Since Card(A m+2 n F) = fc(m + 2) + 1, we have e(w) = ^(w) + r(w) - 1 for 
all w G ^4 m . This implies that G(w) is a tree for all w E F. Thus F is a tree 
set. ■ 

6.2 Automata 

We denote A = (Q,i,T) a deterministic automaton with Q as set of states, 
i G Q as initial state and T C Q as set of terminal states. For p G Q and 
u? G A* , we denote p ■ w = q if there is a path labeled w from j? to the state g 
and p ■ w = otherwise. 
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The set recognized by the automaton is the set of words w € A* such that 
i ■ w € T. A set of words is rational if is recognized by a finite automaton. 

All automata considered in this paper are deterministic and we call them 
simply automata. 

The automaton A is trim if for any q £ Q, there is a path from i to q and a 
path from q to some t E T. 

An automaton is called simple if it is trim and if it has a unique terminal 
state which coincides with the initial state. 

An automaton A = (Q, i, T) is complete if for any state p E Q and any letter 
a € A, one has p • a 7^ 0. 

For a set A C A*, we denote by .-4(A) the minimal automaton of A. The 
states of ^4(A) are the nonempty sets u~ 1 X = {v E A* \ uv E A} for u E A* . 
For u E A* and a E A, one defines (u X) • a = (ua)~ 1 X . The initial state is 
the set X and the terminal states are the sets u~ 1 X for u E X . 

Let X C A* be a prefix code. Then there is a simple automaton A — (Q, 1, 1) 
that recognizes A*. Moreover, the minimal automaton of A* is simple. 

Let A be a prefix code and let P be the set of proper prefixes of A. The 
literal automaton of A* is the simple automaton A = (P, 1, 1) with transitions 
defined for p E P and a E A by 

{pa ii pa E P 1 
1 if pa E X , 
otherwise. 

One verifies that this automaton recognizes A*. 

A simple automaton A = (Q, 1, 1) is said to be reversible if for any a E A, 
the partial map <p^(a) : p i— > p- a is injective. This condition allows to construct 
the reversal of the automaton as follows: whenever q ■ a = p in A, then p- a = q 
in the reversal automaton. The state 1 is the initial and the unique terminal 
state of this automaton. Any reversible automaton is minimal [115] ■ The set 
recognized by a reversible automaton is generated by a bifix code. 

An automaton A = (Q, f , 1) is a group automaton if for any a E A the map 
<PA( a ) '■ P l— ► P'Q is a permutation of Q. When Q is finite, a reversible automaton 
which is complete is a group automaton. 

The following result is from Q15] (see also Exercise 6.1.2 in [B]). 

Proposition 6.8 Let X c A + be a bifix code. The following conditions are 
equivalent. 

(i) A* = (A) n A*; 

(ii) the minimal automaton of A* is reversible. 

Let A = (Q,i,T) be a deterministic automaton. A generalized path is a 
sequence (po> flijPij fl2> • • • >Pn-i) o, n ,p n ) with € iUi" 1 and Pi E Q, such 
that for 1 < i < n, one has • a, = if a, € A and • eij = if a* G A -1 . 
The ?a6eZ of the generalized path is the element a\02 • • • a n of the free group A° . 
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Note that if A — (Q, 1, 1), the set of labels of generalized paths from 1 to 1 in 
A is a subgroup of A°. It is called the subgroup described by A. 

A path in an automaton is a particular case of a generalized path. In the 
case where A has a unique terminal state which is equal to the initial state, the 
submonoid of A* recognized by A is contained in the subgroup of A° described 
by A. 

Example 6.9 Let A = (Q, 1, 1) be the automaton defined by Q = {1,2}, 
1 • a = 1 • b = 2 and 2 • a = 2 • b = 0. The submonoid recognized by A is {1}. 
The subgroup described by A is the cyclic group generated by ab^ 1 . 

The following result is proved in [5]. 

Proposition 6.10 Let A be a simple automaton and let X be the prefix code 
generating the submonoid recognized by A. The subgroup described by A is 
generated by X. If moreover A is reversible, then X* — (X) n A* . 

For any subgroup H of A° , the submonoid H <~) A* is generated by a bifix 
code. A subgroup H of A° is positively generated if there is a set X <Z A* which 
generates H . In this case, the set H n A* generates the subgroup H. Let X be 
the bifix code which generates the submonoid H n A* . Then X generates the 
subgroup H. This shows that, for a positively generated subgroup H, there is 
a bifix code which generates H. The following result is proved in [5]. 

Proposition 6.11 For any positively generated subgroup H of A° , there is a 
unique reversible automaton A such that H is the subgroup described by A. 

The reversible automaton A such that H is the subgroup described by A is 
called the Stallings automaton of the subgroup H. It can also be defined for a 
subgroup which is not positively generated (see [4] or [15]). 

The following result is proved in [5]. 

Proposition 6.12 The following conditions are equivalent for a submonoid M 
of A*. 

(i) M is recognized by a group automaton with d states. 

(ii) M = <p {K), where K is a subgroup of index d of a group G and tp is a 
surjective morphism from A* onto G. 

(iii) M = H n A* , where H is a subgroup of index d of A° . 

If one of these conditions holds, the minimal generating set of M is a maximal 
bifix code of degree d. 

A bifix code Z such that Z* satisfies one of the equivalent conditions of the 
proposition 16.121 is called a group code. 

The following proposition shows in particular that a subgroup of finite index 
is positively generated. For a proof, see [5]. 
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Proposition 6.13 Let H be a subgroup of finite index of A . The minimal 
automaton A of H D A* is a group automaton which describes the subgroup H. 
Let X be the group code such that A recognizes X* . The subgroup generated by 
X is H. 

Example 6.14 The set A d is a group code by Proposition 16. 12f ii) . Thus it is 
a maximal bihx code of degree d. The intersection of the subgroup generated 
by A d with A* is the submonoid generated by A d (Proposition I6.13[) . It is 
composed of the words with length a multiple of d. 

6.3 Incidence graph 

Let X be a set, let P be the set of its proper prefixes and S be the set of 
its proper suffixes. Set P' = P \ 1 and S' = S \ 1. Recall from [5] that the 
incidence graph of X is the undirected graph G defined as follows. The set of 
vertices is the disjoint union of P' and S'. The edges of G are the pairs (p, s) for 
peP' and s G S' such that ps G X. As in any undirected graph, a connected 
component of G is a maximal set of vertices connected by paths. 

The following result is proved in [5] in the case of a Sturmian set (Lemma 
6.3.3). 

Proposition 6.15 Let F be a tree set, let X C F be a bifix code and let G be 
the incidence graph of X . 

(i) The graph G is acyclic. 

(ii) The intersection of P' (resp. S 1 ) with each connected component of G is 
a suffix (resp. prefix) code. 

Proof. We call a path reduced if it does not use equal consecutive edges. Let 
us prove by induction on n > 1 that if s G S' and ( 6 5' (resp. p £ P' and 
q G P') are connected by a reduced path of length 2n in G, then s, t are not 
prefix-comparable (resp. p, q are not suffix-comparable). This proves both (i) 
and (ii). 

The property holds for n = 1. Indeed, a reduced path of length 2 from s G S' 
to t G S' has the form s,q,t with qs,qt G X. Since the path is reduced, s ^ t 
and since X is prefix, s and t are not prefix-comparable. The proof for prefixes 
is similar. 

Let n > 2. A path of length 2n from s G S' to t G S' is a sequence 

Vi,Ul,V2, . . . ,U n ,V n+ i 

with s — v\ and t = v„+i such that the In words defined for 1 < i < n by 

X2i-1 = UiVi, X 2 i = U t V i+ l 

are in X. Moreover, since the path is reduced, one has Xj ^ Xj+i for 1 < j < 2n. 

Arguing by contradiction, assume that v\ and w n +i are prefix-comparable. 
We may assume that v\ is a prefix of i>„+i. The other case is symmetric. Two 
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elements of the set U — {u\, . . . , u n } are connected by a path of length at most 
2n — 2 (using elements of {v2, • ■ •«„}). Thus, by induction hypothesis, U is a 
suffix code. Similarly, the elements of the set V = {vi, . . . ,v n } are connected 
by a path of length at most 2n — 2 (using elements of {ui, . . . u n -i})- Thus 
V is a prefix code. By Corollary 14.81 the graph Gu,v{£) is acyclic. But since 
v\ is a prefix of we have u n v\ G F and thus Gu,v(s) contains the cycle 

Ui , Vi , U2, . ■ ■ , v n , u n , Vi , a contradiction. ■ 

Let X be a bifix code and let P be the set of proper prefixes of X. Consider 
the equivalence Ox on P which is the transitive closure of the relation formed 
by the pairs p,q G P such that ps,qs G X for some s € A + . Such a pair 
corresponds, when p, q ^ 1, to a path (p, s, q) in the incidence graph of X. Thus 
a class of Ox is either reduced to the empty word or it is the intersection of P \ 1 
with a connected component of the incidence graph of X. 

The following property relates the equivalence Ox with the right cosets of 
H = (X). It is Proposition 6.3.5 in [5]. 

Proposition 6.16 Let X be a bifix code, let P be the set of proper prefixes of 
X and let H be the subgroup generated by X . For any p,q G P, p = q mod Ox 
implies Hp = Hq. 

Let A = (P, 1, 1) be the literal automaton of X*. We show that the equiva- 
lence Ox is compatible with the transitions of the automaton A in the following 
sense. 

The following is proved in [5] (Lemma 6.3.6 and Lemma 6.4.2) in the case 
of a Sturmian set F. 

Proposition 6.17 Let F be a tree set. Let X C F be a bifix code and let 
P be the set of proper prefixes of X. Let p,q G P and a G A be such that 
pa, qa G P U X . Then in the literal automaton of X* , one has p = q mod Ox if 
and only if p ■ a = q ■ a mod Ox ■ 

Proof. Assume first that p = q mod Ox- Let (uq, v±, u±, . . . , v n , u n ) be a path in 
the incidence graph G of X with p = uq, u n — q. The corresponding words in X 
are uqVi,uiVi,uiV2, ■ ■ ■ , u n v n . We may assume that the words Ui are pairwise 
distinct, and that the Vi are pairwise distinct. Moreover, since pa, qa G P U X 
there exist words v, w such that pav, qaw G X. Set vq — av and v n +i = aw. 

By Proposition ^. 15| the set U — {uo, u\, . . . , u n } is a suffix code. Let ai be 
the first letter of Vi for < i < n + 1 and set Vi = aiv[. 

Then in the graph Gu,a{£), there is a cycle (ao, uq, a\, . . . , a n , u n , a n +i). 
Since U is a suffix code and since F satisfies the tree condition, this forces 
a, = a for 1 < i < n. 

Assume first that pa, qa G P. Then (uoa, v[, u\a, . . . , v' n , u n a) is a path from 
pa to qa in G. This shows that pa = qa mod Ox ■ 

Next, suppose that pa € X and thus that vq = a. By Proposition 16 . 1 5l the 
set V — {vq, vi,..., v n+ i\ is a prefix code. Since vq = a and each V{ begins with 
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a, this forces Vi = a for < i < n + 1. In particular w = e and thus qa € X. 
Thus p ■ a — q ■ a. 

Conversely, if pa = qa mod Ox, assume first that pa, qa £ P. Then there is 
a path (uq, vt,..., v n , u n ) in G with ug = pa and u n = qa. By Proposition l6.I5l 
the set V = {vi, . . . ,v n } is a prefix code. Let a; be the last letter of Ui for 
< i < n and set it, = u^aj. Then in the graph Ga,v{s), there is a cycle 
(a, Ui, ai,U2, • • • , a n ,v n , a). Since F satisfies the strong tree condition, this forces 
a-i = a for 1 < i < n. Thus (p, avi,Ui, . . ■ , q) is a path in G, showing that 
p = q mod 9x ■ 

Finally, if pa, qa £ X , then (p, a, q) is a path in G and thus p = q mod 



6.4 Coset automaton 

Let F be a tree set and let X c F be a bifix code. We introduce a new 
automaton denoted Bx and called the cosei automaton of X. Let i? be the set 
of classes of Ox with the class of 1 still denoted 1. The coset automaton of X 
is the automaton Bx — (R, 1,1) with set of states R and transitions induced 
by the transitions of the literal automaton A — (P, 1,1) of X* . Formally, for 
r,s £ R and a £ A, one has r ■ a = s in the automaton B if there exist p in the 
class r and q in the class s such that p ■ a = q in the automaton „4. 

Observe first that the definition is consistent since, by Lemma 16.171 if p • a 
and p' ■ a are nonempty and p,p' are in the same class r, then p ■ a and p' ■ a 
are in the same class. Since the class p • a is uniquely defined, the automaton is 
indeed deterministic. 

Observe next that if there is a path from p to p' in the automaton A labeled 
w , then there is a path from the class r of p to the class r' of p' labeled w in 




Figure 6.1: The automaton Bx- 



Example 6.18 For the code X of Example 15.241 the automaton Bx has three 
states. State 2 is the class containing b, and state 3 is the class containing 
ba. The bifix code generating the submonoid recognized by this automaton is 
Z = aU b(ab*a)*b. Observe that the word bb is in Z* but it is not in X*. 

The following result shows that the coset automaton of X is the Stallings 
automaton of the subgroup generated by X. 

Lemma 6.19 Let F be a tree set, and let X C F be a bifix code. The coset au- 
tomaton Bx is reversible and describes the subgroup generated by X . Moreover 
X C Z , where Z is the bifix code generating the submonoid recognized by Bx- 
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Proof. Let A = (P, 1, 1) be the literal automaton of X and set Bx = {R, 1, 1). 
By Proposition |6H7l the automaton Bx is reversible. 

Let Z be the bifix code generating the submonoid recognized by Bx- To 
show the inclusion X C Z, consider a word x G X. There is a path from 1 to 1 
labeled x in A, hence also in Bx- Since the class of 1 modulo Ox is reduced to 
1, this path in Bx does not pass by 1 except at its ends. Thus x is in Z. 

Let us finally show that the coset automaton describes the group H = (X). 
By Proposition 16. 101 the subgroup described by Bx is equal to (Z). Set K = 
(Z). Since X C Z, we have H C K . To show the converse inclusion, let us 
show by induction on the length of w G A* that if, for p,q G P, there is a 
path from the class of p to the class of q in Bx with label w then Hpw = Hq. 
By Proposition 16 . 1 61 this holds for w — 1. Next, assume that it is true for w 
and consider wa with a G A. Assume that there are states p,q,r G P such 
that there there is a path from the class of p to the class of q in Bx with 
label w, and an edge from the class of q to the class of r in Bx with the 
label a. By induction hypothesis, we have Hpw = Hq. Next, by definition of 
Bx, there is an s = q mod Ox such that s ■ a = r mod If € P, then 
s • a = sa, and by Proposition 16.161 we have Hs = Hq and Hsa = Hr. Thus 
Hpwa — Hqa = Hsa = Hr. Otherwise, sa € X and s ■ a — r — 1 because the 
class of 1 is a singleton. In this case, Hsa = H = Hr. This property shows that 
if z £ Z, then Hz = H, that is z £ H. Thus Z C H and finally H = K. ■ 

6.5 Proof of the Basis Theorem 

We first prove the following statement which is the last assertion of Theorem l6.H 

Proposition 6.20 Let F be a tree set. Let X C F be a finite F -maximal bifix 
code. Then (X) f] F = X* n F. 

Proof. We have X* n F C (X) n F. To show the converse inclusion, consider 
the bifix code Z generating the submonoid recognized by the coset automaton 
Bx associated to X. 

Let us show that Z P\ F = X. By Lemma 16.191 we have X C Z and thus 
X C Z <~) F. Since X is an F-maximal bifix code, this implies that X = Z P\ F. 

Since any reversible automaton is minimal and since the automaton Bx is 
reversible by Lemma T6.191 it is equal to the minimal automaton of Z* . Let K 
be the subgroup generated by Z. By Proposition ^. 81 we have K n A* = Z* . 

This shows that 

(x)nFcKnF = KnA*nF = z*nF = x*nF. 

The first inclusion holds because X C Z implies (X) C K. The last equality 
follows from the fact that if z\ • • • Z n G F with z\, . . . , z n G Z , then each Zi is in 
F hence in Z n F = X. Thus (X) CI F C X* n F, which was to be proved. ■ 

We denote by A° the free group on A. 
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Proof of Theorem 16.11 Assume first that X is an F-maximal bifix code of F- 
degree d. Let P be the set of proper prefixes of X. Let H be the subgroup 
generated by X. 

Let u £ F be a word such that Sx(u) = d, or, equivalently, which is not an 
internal factor of X. Let Q be the set of suffixes of u which are in P. 

o u o 

o u o y o 

o q o 

r 

o o 

Figure 6.2: A word y £ Rf{u). 

Let us first show that the cosets Hq for q £ Q are disjoint. Indeed, HpDHq ^ 
implies Hp = Hq. Any p,q £ Q are comparable for the suffix order. Assuming 
that q is longer than p, we have q — tp for some t £ P. Then Hp = Hq implies 
Ht = H and thus t £ H n F. By Proposition 16.201 since F is a tree set, this 
implies t £ X* and thus t = e. Thus p = q. 

Let 

V = {v £ A° | Qv c HQ} . 

For any v £ V the map p i— > g from Q into itself defined by S -ffg is a 
permutation of Q. Indeed, suppose that for p,q £ Q, one has pv,qv £ Hr for 
some r £ Q. Then rv^ 1 is in Hp n -ffg and thus p = q by the above argument. 

The set V is a subgroup of A . Indeed, 1 £ V. Next, let v g V. Then 
for any q £ Q, since w defines a permutation of Q, there is a p £ Q such that 
pv £ Hq. Then £ Hp. This shows that v^ 1 £ V. Next, if v,w £ V, then 
Qvw C HQw C -ffQ and thus vw £ V . 

We show that the set Rf(u) is contained in V. Indeed, let q £ Q and 
y £ Rf(u). Since q is a suffix of u, qy is a suffix of uy, and since uj/ is in F 
(by definition of Rp{u)), also is in F. Since X is an i^-maximal bifix code, 
it is an F-maximal prefix code and thus it is right F-complete. This implies 
that qy is a prefix of a word in X* and thus there is a word r £ P such that 
qy £ X*r. We verify that the word r is a suffix of u. Since y £ Rf{u), there 
is a word y' such that uy — y'u. Consequently, r is a suffix of y'u, and in fact 
the word r is a suffix of u. Indeed, one has \r\ < \u\ since otherwise u is in the 
set I(X) of internal factors of X, and this is not the case. Thus we have r £ Q 
(see Figure HO]) . Since X* C i? and r £ Q, we have gy G ifQ. Thus y £ V. 

By condition (ii), the group generated by Rf(u) is A°. Since Rf(u) C V, 
and since V is a subgroup of A°, we have V — A° . Thus Qui C iJQ for any 
w £ A°. Since 1 £ Q, we have in particular w £ HQ. Thus A° = HQ. Since 
Card(Q) = d 1 and since the right cosets Hq for q £ Q are pairwise disjoint, 
this shows that H is a subgroup of index d. By Theorem 13.41 and in view of 
Schreier's Formula, X is a basis of H . 

Assume conversely that the finite bifix code X C F is a basis of the group 
# = (X) and that (A) has index d. Since A is a basis, by Schreier's Formula, 
we have Card(A) = (k — l)d + 1, where k — Card(A). The case k = 1 is 
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straightforward; thus we assume k > 2. Since F is uniformly recurrent, by 
Theorem 4.4.3 in [5], there is a finite _F-maximal bifix code Y containing X. 
Let e be the F-degree of Y. By the first part of the proof, Y is a basis of a 
subgroup K of index e of ^4°. In particular, it has (k — l)e + 1 elements. Since 
X C Y, we have (fc — l)tf + 1 < (k — l)e + 1 and thus d < e. On the other hand, 
since H is included in K, d is a multiple of e and thus e < d. We conclude that 
d = e and thus that X = Y. m 
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